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MICHAEL CHRIST 

Abstract. To any finite collection of smooth real vector fields Xj in R™ we as- 
sociate a metric in the phase space T*R n . The relation between the asymptotic 
behavior of this metric and hypoellipticity of ^Z-X^, in the smooth, real analytic, 
and Gevrey categories, is explored. 



To Professor P. Lelong, on the occasion of his 85th birthday, in admiration. 

I. Introduction 

Let {Xj} be a collection of real vector fields with C°° or C u coefficients, defined 
in a neighborhood of a point xq G M d . Consider a second order differential operator 
L = Y^j X 2 . Under what conditions is L hypoelliptic, in C°° or C w ? As is well known, 
certain of these sums of squares operators provide a good model for aspects of the 8- 
Neumann problem in C 2 , and a grossly oversimplified but still illuminating window 
into the higher dimensional case. Indeed, application of the method of boundary 
reduction to the <9-Neumann problem leads to a pseudodifferential equation on the 
boundary; microlocalizing and composing with an elliptic factor then leads in C 2 to 
an operator closely related to X 2 + Y 2 for certain vector fields. We believe that the 
study of more general collections of vector fields than those directly relevant to the 
<9-Neumann has shed, and will shed, light on the special cases of interest in complex 
analysis. 

In a companion paper [7] we have established a sufficient condition for the C°° case 
which subsumes a variety of previously known results and examples. This condition 
involves the existence of auxiliary operators, having certain favorable commutation 
relations with L. 

Except in very special cases, there remains a large gap between the hypotheses 
of the sufficient conditions for hypoellipticity in various function spaces, and coun- 
terexamples suggesting what hypotheses are necessary. Many works on various types 
of hypoellipticity and regularity have relied on commutation methods to achieve re- 
sults in the positive direction, whereas negative results have been attained through 
different methods. 

How closely is hypoellipticity linked with the commutation method? The aim of 
this note is to begin to directly investigate this question. We will: 

1. Define a metric pl, adapted to L, on the cotangent bundle, and show how 
a necessary condition for the existence of auxiliary operators having desirable 
commutation properties with L may be formulated in terms of pi- 
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2. Speculate on the connection between properties of pi and hypoellipticity of L 
in , in Gevrey classes, and to a lesser extent in C°°. 

3. Analyze the behavior of pi in fundamental examples. 

A well known construction associates to the collection {Xj} of vector fields a 
Carnot-Caratheodory, or control, metric on the base space M. d , rather than on the 
cotangent bundle. This metric, and its connection with the fundamental solution of 

Xj, has been intensively studied by Fefferman and Phong [12], by Nagel, Stein and 
Wainger [17], and by Sanchez-Calle [19]. A cotangent bundle metric has been defined 
by Fefferman and Parmeggiani [11], [18]. Their metric differs from p^ in fundamental 
respects, as explained in §4 below. 



2. Definitions 

Let p : T*IR d i— > (0, oo) be any positive nonvanishing real symbol belonging to the 
class Sl , and let {Vj} be any finite collection of vector fields, defined on a conic 
open subset of the cotangent bundle T*R d , with real C°° coefficients. Assume that 
each Vj takes the form J2 k dj,kd/dxj + Ylkbj,kd / d^j where a Jjfc 6 Sfg and bj :k G S\ . 

Our first definition is made only to facilitate the main definitions below. 

Definition 1. A locally C 1 function ip : T*M. d h- > R is said to be Lipschitz relative to 
{p, {Vj}) if at every point p = (x, f ) of T*R d , 

(1) \Vj^(p)\ < H(p) for allj < n 



and 

(2) ^(p)(0 _1 |V^(p)| +p( P )\W^(p)\ < p{p). 

The main condition here is (1), which imposes a stringent upper bound on the vari- 
ation of ip in the directions Vj, at places where p(x,£) is smaller than (£) and some 
vector field Vj is relatively large. The common factor of p{p) on both sides of (2) is 
not a typographical error and will be explained below. 

Next, let finitely many C°° real vector fields Xj be given on an open subset of the 
base space M d . Recall that to any smooth function / : T*M. d i— > R is associated the 
Hamiltonian vector field 

1 ~i \dx n d£ n d£ n dx n ) 

Define <jj(x,£) to be the principal symbol of iXj, which is real valued. Denote by 
H aj the Hamilton vector field associated to o~j. 

In most of the discussion we assume the bracket hypothesis to be satisfied, to some 
order m. Then consider the set S consisting of all multi-indices / = . . . I k ) of 
degree \I\ — k E {1, 2, . . . m} such that each I s e {1,2,... n}. For each I E S define 
(Ji(x,€) to be the iterated Poisson bracket {<Tj k ,aj>} where /' = . . . I k -i)- Define 
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the effective symbol 1 a of L to be 

(3) ^,o 2 = £m^)I 2/|/| - 

ies 

The bracket hypothesis is equivalent to the existence of C, e G M + and m such that 

a(x,0>C(0 £ for all {z,£). 

The symplectic structure of the cotangent bundle is reflected in the next definition. 

Definition 2. A C 1 function ip defined on T*IR d is said to be microlocally Lipschitz 
relative to {Xj} if it is Lipschitz relative to (//, {Vj}) = (a,{H aj }) where b is the 
effective symbol defined in (3), and H a . is the Hamiltonian vector field associated to 
the principal symbol Oj of iXj . 

With this definition, even when L = i s elliptic, (1) by itself does not force ip 

even to be continuous; when L has constant coefficients, for instance, each Vj = H aj 
is everywhere in the span of {d/dxi}. The auxiliary condition (2), which does not 
depend on the vector fields Xj at all because the same factor of p occurs on both 
sides, does ensure that ip is locally Lipschitz with respect to the standard metric 

dpi = (o 2 dx 2 + de. 

(2) is simply one of the inequalities defining the symbol class S\ , rewritten so as to 
more closely resemble (1). (2) implies (1) when L is elliptic, but not in general. 

Our main definition is as follows. Associated to {Xj}, via a and the collection of 
Hamiltonian vector fields {H a .}, is a metric p on T*M. d . 

Definition 3. Let Xj be real C°° vector fields in an open subset ofW 1 . Then 

(4) P{x,}(p,q) = sup \ip(p) -tjj(q)\, 

where the supremum is taken over all functions ip that are microlocally Lipschitz 
relative to {Xj}. To L = ^2X? is associated the metric 

(5) PL = P{Xj}- 

The motivation for this definition stems from a proof of positive hypoellipticity re- 
sults; see [7]. 

Pl is defined as a metric in the sense of point set topology, but the definition could 
be modified to give a Riemannian metric whose properties are equivalent for our 
purpose. (2) implies that for any collection of vector fields Xj, there exists C < oo 
such that P{Xj} < Cpo- 

One property of pl is that L is elliptic if and only if pi is comparable (on the 
infinitesimal level) to p . 



1 In the absence of the bracket hypothesis, there is no satisfactory general description of the 
effective symbol. Lemma 3.1 of [7] describes one narrow class of nonsubelliptic operators for which 
there exists an effective symbol which is significantly larger than the principal symbol. 
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Remark. The construction of pi is conformally invariant in the sense that if L 2 = 
r 2 ■ L\ for some constant r ^ 0, and if L\ is a subelliptic sum of squares operator, 
then p l = p 2 . 

If Li = *£2jXj and L 2 = J2j(r(x)Xj) 2 where O^rG C°°, then the two metrics 
have the same asymptotic behavior, in the sense that their ratio tends to one as the 
£ coordinate tends to infinity through any connected conic open set in which L\ has 
strictly subelliptic behavior in the sense that C _1 po — Pi — CPo 5 f° r separated pairs 
of points. 

There exists another construction which roughly speaking is intermediate between 
Pl and the more familiar Carnot-Caratheodory metric, and which will help us to illus- 
trate what phenomenon is measured by pl but not seen by the Carnot-Caratheodory 
metric. To any finite family {Xj} of smooth real vector fields on a connected open 
set U C M. d , this construction associates a one parameter family qr of metrics on U, 
rather than a single metric on phase space. 

Define the auxiliary quantity v : U x M + 1— > M + by 

(6) v(x, R) — min a(x, £). 

\S\=R 

One property of v is that for any R,R'> 1 satisfying R ~ R', one has u(x, R) ~ 
v(x,R'), uniformly in x,R,R'. If the bracket condition holds to order m at x, then 
v(x, R) > cR}l' m as R -> 00. 

Definition 4. Qii(p,q) is the supremum of \ip(p) —tp(q)\, taken over all functions ip 
satisfying 

(7) \X j {^){x)\<v{x,R)forallj 
and 

(8) is(x,R)R- 1 \V x 4>\ < v(x,R). 

An equivalent formulation is that qr is the Carnot-Caratheodory metric associated 
to {v(x, R) _1 Xj} U {R^d / dxk : 1 < k < d}. If the bracket condition holds to order 
m at every point of U, then Qr(-, •) > cR}l m on any compact subset of U x U minus 
the diagonal, as R — > 00. 

This construction takes a simpler form in the important special case where the 
set of vector fields Xj has cardinality d — 1 and is everywhere linearly independent. 
Then the characteristic variety of YlX 2 is a line bundle. It may be easily checked 
that a quantity equivalent to v(x, R) is obtained by deleting the supremum in (6) 
and instead taking £ to be an element of the fiber of that bundle lying over x, of 
magnitude R. 

3. The Question 

The following question may be fundamental. Let s G [1, 00). We say that two sets 
T,r' C T*M. d are c-separated if any two points p — (x, £) e T, p' — G V 

satisfy \x - x'\ + (1 + |f | + IC'I)" 1 ^ - f'| > c. Let {Xj} be a finite collection of C u 
real vector fields. 
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Main Question. Consider the condition: For every c > 0, there exists c' > such 
that for any two c-separated open sets T, V C T*M. d , 

(9) p L (p,p') > c'po(p,p') 1/s for allpe T,p> G V. 

Is (9) or some closely related condition nearly equivalent to microlocal hypoellipticity 
of X? in the Gevrey class G s ? 

It is essential in (9) that T, T' be separated; thus no infinitesimal comparison of 
metrics is made. 2 Restricting attention to pairs (p,p f ) for which po(p,p') ~ 1, rather 
than merely > 1, results in an equivalent question. 

In the remainder of this paper, the metric pl will be analyzed for a number of 
examples, the optimal C°°, analytic, and Gevrey class hypoellipticity properties of 
which are already known for each example but the last. In each known case, the 
result will be seen to be consistent with an affirmative answer to this question, with 
the word "nearly" omitted. 

Nonetheless we have deliberately refrained from posing the question more precisely 
or labeling it a conjecture. Some variant of (9) may be more directly related to 
hypoellipticity. Some such variants impose stronger restrictions on vp, involving upper 
bounds on its partial derivatives of orders greater than one, requiring it to belong 
to an appropriate symbol class; thus when s — 1, ip should perhaps be an analytic 
symbol as defined for instance in [21]. Our question merely captures a first order 
approximation to such a condition. Such variants of (9) are genuinely inequivalent, 
as shown in Example 9 below. To the extent of this author's knowledge, such a 
stronger variant is indeed satisfied with s = 1 for every operator that has so far been 
proved to be analytic hypoelliptic. 

As will be seen in the subsequent discussion, (9) for s = 1 is related to a conjectured 
criterion for analytic hypoellipticity proposed by Treves [22]. It would be worthwhile 
to further analyze this connection. Our final Example 9 is an interesting test case 
which may distinguish between the two points of view. 

For C°° hypoellipticity the analogous condition would be that for every A < oo 
there should exist Ca < oo such that 

(10) p L (p,p')>Alog Po (p,p')-C A 

for all p G T,p' G T', where pl would denote the metric associated to an appropriately 
defined effective symbol of J^Xj, if a meaningful effective symbol were to exist. 

4. Examples 

The notation A ~ B will mean that the quantity A/B is bounded above and 
below by positive finite constants, uniformly in all relevant parameters; usually these 
parameters will be points in the cotangent bundle or certain subregions thereof, and a 
large parameter A. Throughout the discussion, the zero section is always understood 
to be excluded from any cotangent bundle. 

2 When s = 1, the effective symbol may be replaced by (1 + ■o]) 1 ^ 2 in the definition of pl 
without affecting the validity of (9). 
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Example 1. Let L = Yl-^-k be an arbitrary sum of squares operator satisfying 
the bracket hypothesis to order m at every point of an open set U C R d . Then 
> c(l + |f |) 1/m . Therefore in the region where |£| ~ A, the functions X 1 ' m Xj 
and \^( m ~ 1 )/ m ^j are microlocally Lipschitz, up to a constant factor, uniformly for 
large A. 

From this it follows that 

(11) PL( P ,q)>cp (p,q)^ m 

whenever po(p,q) > 1. This is consistent with an affirmative answer to our main 
question, a theorem of Derridj and Zuily asserting that such operators are Gevrey 
class hypoelliptic of every order s > to, and the existence of such operators which 
are not Gevrey hypoelliptic of any smaller order, such as (13). 

On the other hand, for any operator we have Pl < Cp . In particular, when L is 
elliptic, pi is pointwise comparable to po- 

Remark. This conclusion for elliptic operators underscores the contrast between p L 
and a metric p L introduced by Fefferman and Parmeggiani [11], [18]. Firstly, the 
correspondence between L and pi has a fundamental monotonicity property: For 
any two operators L\ and L 2 whose principal symbols are nonnegative and satisfy 
Ci(p) < 02 (p) for every point p G T*R™, one has PlAPiP') — Pl 2 {P->p') f° r an PiP' ■ 
The mapping from L to pi has by design no such monotonicity; this reflects the 
failure of hypoellipticity of L\ (in various function spaces) to imply hypoellipticity of 
L 2 when o\ < <7 2 , in general. The Carnot-Caratheodory metric associated to L on 
the base space M. d likewise shares this monotonic dependence on L. 

Secondly, up to order of magnitude, pi is largest when L is elliptic, whereas pL is 
then smallest. 

Example 2. The simplest type of obstruction to (9) in the case s = 1 is the possible 
existence of a nonvanishing vector field V in the span of {H aj }, such that some 
integral curve 7 of V is contained in the characteristic variety £ = {p : <Jj{p) = 
for all j}. Treves [20] has conjectured that this obstruction should preclude analytic 
hypoellipticity of L. 

To see that (9) must fail to hold with s = 1 in this situation, let p = (x, £) and 
p' = (x', £') be any two distinct points of 7, and for large A G IR + set Xp = (x, A£) 
and Xp' = (x',X^ r ). Consider the integral curve 7a joining Xp to Xp', parametrized 
by s 1— > exp(sV^)(Ap); we have Xp' = exp(c V)(Xp), where c is a positive constant 
independent of A, because the symbols dj of the underlying vector fields are homoge- 
neous functions of degree one on T*M. d . Moreover 7^ is contained in E by hypothesis. 
As is well known, this implies a loss of at least one half of a derivative: 

Hx,t)<c(i + \t\) 1/2 ~\ 1/2 

along 7 A . 

Any microlocally Lipschitz function ip must satisfy |V(-?/>)| < a < CX 1 ^ 2 along 7a. 
Consequently \4>(Xp) — ip(Xq)\ < CA 1 / 2 . Taking the supremum over all such ip yields 

P{X3} (Xp,Xp')<Cpo{XpAp') 1/2 
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as A — > +00. Consequently (9) fails to be satisfied for s — 1. 
More precisely, if a(Xq) < C\ 1 ^ m for every q belong to 7 A , then 

(12) PL(\p,\p')<Cpo(\p,\p') 1/m . 

A concrete example is the Baouendi-Goulaouic type operator [1] in R 3 : 

(13) d 2 x + &i + (x™-%y 

Take p = (0, 0, 0; 0, 0, 1) and p' = (0, 1, 0; 0, 0, 1) and V = d y . 

Remark. There are two different ways to formulate the conjecture of Treves men- 
tioned in the discussion of the preceding example. One involves curves 7 C £ whose 
tangent vectors 7 belong to the span of {H a .} at each point of 7; the other involves 
curves for which 7 is orthogonal, with respect to the skew symmetric form defined by 
the canonical symplectic form u, to the tangent space to £ at each point of 7. These 
two conditions on 7 are in fact equivalent, under the assumption that the collection 
of vector fields V x ^aj has constant rank in a neighborhood of p = 7(0). 

Indeed, £ is by assumption a smooth manifold in a neighborhood of p whose 
tangent space is the set of all vectors v G T p (T*M. d ) satisfying v (<7j) = for all j. 
Denote by W L the orthocomplement, with respect to u, of any subspace W of T p £. 
Since u is nondegenerate, {W^) 1 - = W for all subspaces. Define V to be the span of 
{H aj (p)}, a linear subspace of T p £. 

Let w = 7(0) G T p £. Then w _L W T p £ if and only if cu(v,w) = for all v such 
that v{(7j) = for all j. Since f(oj) = u(v,H a .), the latter condition on w may be 
restated as u(v, w) = for all w G V" 1 . Thus w _U T P S if and only if w G {V^-) 1 - = V. 

Remark. A more general conjecture formulated by Treves in [22] involves the con- 
struction of a stratification E = E D £1 D • • • , based upon the vanishing not only 
of the symbols <jj, but also of their iterated Poisson brackets, as well as on quantities 
such as the dimension of the span of their gradients and the rank of the quadratic 
form defined by the restriction to the tangent space of a stratum of the symplectic 
form. One considers curves 7 D E fe \E fc+ i whose tangents are orthogonal to TE fc , 
with respect to uo. According to the preceding remark, this means that 7 belongs 
to the span of the set of Hamiltonian vector fields associated to the collection of all 
iterated Poisson brackets of the symbols Oj up to a certain order n, depending on 
k. The discussion of the preceding example may be generalized to conclude that the 
existence of such a curve 7 implies a certain upper bound on p^, depending on n. 

Example 3. Suppose that Xi,...X^-i are linearly independent at each point of 
U C M. d , which throughout this example are assumed to satisfy the bracket hypothesis 
to order two at every point of U. Then Pl(p, q) > cpo(p, o) 1 ^ 2 for separated points, as 
we have seen in Example 1. In Example 2 it is shown that this lower bound cannot 
in general be improved. 

If however the characteristic variety E = {(x, £) : (Jj(x, £) = for all j} is a sym- 
plectic manifold, then each Hamiltonian vector field H aj is everywhere transverse to 
E. From this is easily deduced the improvement Pl(p, <?) > cpo(p, q) 1 for all separated 
points. This is consistent with the fundamental theorem of Tartakoff and of Treves, 
which asserts analytic hypoellipticity in the symplectic situation. 



8 



MICHAEL CHRIST 



Now consider the one parameter family of metrics qr on U of Definition 4. E is 
a line bundle over U, and u(x, R) ~ R 1 / 2 for all x, R. From this and the order two 
bracket hypothesis it is easily deduced that Qr(x,i/) is comparable to R 1 / 2 times the 
associated Carnot-Caratheodory metric at distances greater than R~ x l 2 with respect 
to the latter metric. In particular, Qr(x,u) ~ R 1 ! 2 for fixed x ^ y. This is so, 
regardless of whether E is symplectic. Thus the family qr fails to detect a property 
of the vector fields which is fundamental for analytic hypoellipticity. 

Example 4. Let W C M. d be any connected open set. Let T be a conic open subset of 
T*W. Suppose that L is elliptic in T. Then for separated points p, q lying in distinct 
topological components of T*W\T, Pl(p,q) > cpo(p,q)- Indeed, any tp e S^ whose 
gradient is supported in T is microlocally Lipschitz, up to multiplication by a bounded 
constant factor, and there exist such functions satisfying \ip(p) — ip(q)\ ~ Po(p,q)- 

More generally, if <r(x,£) > C|£| 5 for all (x,^) e T, then by the same reasoning 
Pl{p, <?) > cp (p, q) s for all p, q satisfying the same hypothesis. 

The next example concerns C°° hypoellipticity. In the C°° case one seeks only 
to have p(p,p') ^> log p (p,p') for separated p, p' . Under the very weak hypothesis 
that L has an effective symbol cr(x,!;) which tends to infinity as |£| — > oo, this 
bound is easily obtained whenever |£| and |£ — £'| are large, because the function 
if}(x,i]) = log(|?7 — £|) is always locally Lipschitz modulo a constant factor, and an 
arbitrarily large constant multiple of ip is Lipschitz relative to {Xj} for sufficiently 
large |£|, provided that a — > oo. 

Example 5. In M 2 with coordinates (x,t) consider L = X 2 + Y 2 where 

X = d x , Y = a(x)d t . 

Suppose that a e C°°, and that a(x) = if and only if x = 0. All such operators are 
C°° hypoelliptic [9], regardless of their degree of degeneracy. Let us see that this is 
consistent with our point of view. 

The characteristic variety is E = {x = £ = 0} (which is symplectic). Because of 
the remark made two paragraphs above, the condition pl(p, q) 3> logpo(p, q) is most 
likely to fail when p, q e S and both have the same r coordinate. Thus let p = 
(0, 0,0, r) and q = (0,0, l,r) with r large and positive. Consider ip(x,t : ^,r) = tr. 
Certainly \ip(p) — ip{q) \ = r is ^> log po(p, q) ~ log r, and we claim that ip is Lipschitz. 
Indeed, the two Hamiltonian vector fields are d x and a(x)d t — ra'(x)d^. The former 
annihilates ip, while applying the latter to ip gives a(x)r, which equals the symbol of 
Y and hence is acceptably bounded, without taking into account the existence of an 
effective symbol slightly larger than |£| + \a(x)r\. 

Example 6. Let (x, t) be coordinates in IR 2 and (x, t; £, r) be coordinates in the 
cotangent bundle. Consider 

(14) L x = d 2 x + (xd t ) 2 

(15) L 2 = dl + (xd t ) 2 + (td t ) 2 . 

The associated effective symbols are up to order of magnitude equal to a\ ~ 1 + |£| + 
\xt\ + |r| 1/2 , and a 2 ~ 1 + |f | + \xr\ + |^| 1/2 + \tr\. Thus b x < Ca 2 . 
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The Hamiltonian vector fields associated to the vector fields comprising L\ are d x 
and xdt — rd^. For L 2 there is the additional Hamiltonian field tdt — rd T . Thus 
there arise two competing effects in comparing p L , 2 to p^: a 2 is larger, but the 
presence of an additional Hamiltonian vector field introduces additional constraints 
on microlocally Lipschitz functions ip. 

To analyze the comparison, let A G R + be a large parameter and consider the 
two points p = (0, 0; 0, A) and q = (0, 0;0, 2A). The function ip(x,t; £, r) = r is 
microlocally Lipschitz with respect to the metric associated to L 1; because it belongs 
to S\ and is annihilated by the two Hamiltonian vector fields. Therefore p^ x (p, q) > 
cA; it is comparable to the distance between these points in the metric associated to 
any elliptic operator, even though L fails to be elliptic. 

Consider instead L 2 , and consider the restriction of any microlocally Lipschitz 
function ip to the line segment joining p to q. The Hamiltonian vector field td t — rd T 
is tangent to this line segment, and a 2 has order of magnitude A 1 / 2 on the whole 
segment, so the main condition (1) forces \ip(p) — 4>(q)\ < CA 1 / 2 . Thus Pl 2 (PiQ) < 
CA 1 / 2 <C Pl 1 (p, q)- Indeed, Metivier [16] has proved that L 2 is hypoelliptic in G s only 
for s > 2, whereas L\ is G s hypoelliptic for all s > 1 by a special case of a theorem 
of Grusin [14]. 

Further analysis shows that Pl 2 {Pi q) > c Po(p, <?)^ 2 f° r an Y points p, q. L 2 is known 
to be Gevrey hypoelliptic of order s if and only if s > 2, so this example is consistent 
with an affirmative answer to our main question. 

So far we have derived upper bounds on p^ by examining the variation of mi- 
crolocally Lipschitz functions ip along integral curves 7a C S of Hamiltonian vector 
fields. Our next two examples demonstrate subtler effects due to the influence of one 
parameter families of integral curves 7a which do not quite lie in the characteristic 
variety S, but merely tend to S at a certain rate as A — > 00. 

Example 7. With coordinates (x, y, t; £, 77, r) for T*IR 3 , consider 

(16) dl + (x k - l d y f + (x m - l d t ) 2 

where 2 < k < m. Here 

a ~ |f I + j^-^l + Ix^Vl + \r]\^ k + \r\ l ' m . 

Consider the associated Hamiltonian vector field V = x k ~ l d y — (k — l)x k ~ 2 i]d^. Let 
p = (5, 0,0; 0,0, A) where 5 = A~ 1/m , and q = exp(TV)(p) where T = b 1 ^. The 
integral curve 7 joining ptoq takes the simple form exp(sV)(p) = (<5, s5 k ~ 1 , 0; 0, 0, A). 
In particular, q = (5, 1, 0; 0, 0, A). Along 7, a ~ A 1 /" 1 because 5 was chosen so that 
5 k ~ 1 \ ~ A 1 /™. Thus any microlocally Lipschitz function ip must satisfy \Vtp\ < 
C\ 1/m , whence \ip(p) - tp{q)\ < CT\ x l m = CX^-^^X 1 /" 1 = X k / m . 

On the other hand, the function ip(x, y, t; f , 77, r) = X k ' m y is microlocally Lipschitz 
in the region where |(f,r/,r)| ~ A, and \ip(p) — ?p(q)\ is comparable to X k ^ m . Thus 

p L (p,q)~X k / m ~p (p,q) k /™. 
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This is consistent with both an affirmative answer to our main question, and the 
known fact that L is hypoelliptic in the Gevrey class of order s if and only if s > m/ k, 
assuming that m > k [4]. 

Example 8. Fix any integers m, r > 1. In IR 2 consider 

(I?) di + (x m -%r + {fd t f- 

the case r = 1 has been discussed above. One of the associated Hamiltonian vector 
fields is V = t r d t — rt r ^ x rd T . The effective symbol has order of magnitude |£| + \xr\ + 
\t r r\ + \r\^ m . 

Given a large quantity A, define 5 = X'i™- 1 )/™^ so that §r X = A i/m_ Let p = 
(0, 5; 0, A) and q = exp(— TV)(p) where T = 5 1_r . Denote by 7 the segment of 
integral curve exp(— sV)(p), < s < T. For every point of 7, uniformly in A, 
exp(— sV)(p) = (0, t(s); 0, r(s)) where 5 > t(s) > c5 and A < t(s) < CX. Moreover 
q = (0, t(T); 0, r(T)) where r(T) — A is positive and has order of magnitude A. By 
the choice of 5 and the fact that \t(s)\ < 5 for every point of 7, a < C\ 1 ^ m at every 
point of 7. 

Every microlocally Lipschitz function ip must therefore satisfy \Vip\ < C\ x l m at 
every point of 7, and consequently \ip(p) — ip{q)\ < CTX 1 /™. Since T = <5 1-r , we 
conclude after a small computation that 

(18) p(p,q) < CX {rnr ~ m+1)/mr . 

This upper bound on Pl(p, q) is also a lower bound, up to a constant factor. To see 
this consider the function ip(x, t; £, r) = A a • r, where a + 1 = (mr — m + l)/mr < 1. 
Then in the region where t)| ~ A, ^ is a symbol in Sl Q uniformly in A, and 
1^(^)1 < C|t| r ~ 1 A a+1 . To show that tp is in this region microlocally Lipschitz relative 
to the vector fields in question, up to a bounded factor, we must verify that \Vip\ < 
Co. Indeed, this holds because a(x, t;t,r) > cX 1/m + c\t\ r X there, and |t| r - 1 A° +1 
equals the logarithmically convex average (|t| r A) e • (X 1 /" 1 ) 1 - 9 < Co where 6 is defined 
to be (r — l)/r. Thus ip is microlocally Lipschitz, and 

(19) I tP( p ) - V(g) I ~ x (mr - m+1 ^ mr , 

whence Pl(p,q) > c \(.mr-m+i)/mr _ j^. nag mc i eec i been proved in [6], as well as in [2] 
and [15], that L is G s hypoelliptic for all s satisfying s _1 < (mr — m + l)/mr. In 
unpublished work this author has outlined a proof that the threshold mr/ (mr— m+1) 
is optimal, based on the method introduced in [3] and [5]. 

Remark. Both a conjectured characterization of analytic hypoellipticity by Treves 
[22], and a conjectured sufficient condition for Gevrey hypoellipticity by Bove and 
Tartakoff [2] are formulated in terms of properties of a certain stratification of the 
characteristic variety £ of L. This stratification does not distinguish between different 
parameters r in Example 8; r does influence the optimal degree of Gevrey class 
hypoellipticity. 



HYPOELLIPTICITY: GEOMETRIZATION AND SPECULATION 



11 



5. A Subtler Example 

We next discuss an operator for which analytic hypoellipticity apparently remains 
an open question. It is the principal part of the Kohn Laplacian for a three dimen- 
sional pseudoconvex CR manifold, and is of particular interest both because it is one 
of the simplest operators whose analytic hypoellipticity is undecided, and because 
it illustrates the distinction between two variants of our main question; functions ip 
which are merely Lipschitz relative to Pl can vary much more rapidly than those 
which also belong to the symbol class Sl . 

Consider the domain ^(^2) > b(zi) m ^- 2 i w here b is assumed to be subharmonic 
so that this domain will be pseudoconvex. Its boundary may be identified with 
M 3 , with coordinates (x,y,t), in such a way that a Cauchy-Riemann operator is 
db = X + iY where X = d x — b y d t and Y — d y + b x d t . Here b x = d x b, b y = d y b, and 
b(x,y) = b(x + iy). The Kohn Laplacian is then (X + iY)(X — iY), which equals 
X 2 + Y 2 modulo a lower order term. 3 Analytic or Gevrey hypoellipticity of such an 
operator depends only on Ab, where A denotes the Laplacian in M 2 , rather than on 
b itself, for any operator may be transformed into any other with the same invariant 
Ab via a change of variables (x, y, t) 1— > (x, y,t — <f>(x, y)). 

Example 9. In R 3 consider L = X 2 + Y 2 with 

X = d x - b y dt, Y = d y + b x d t 

where b is a polynomial depending only on x,y, satisfying 

Ab(x, y) = x k + y k + x 2 y 2 , 

and k > 6 is an integer. Define X(x, y) = Ab = x k + y k + x 2 y 2 . 

For general b(x,y) it has been shown by Grigis and Sjostrand [13] that if Ab 
vanishes only at x = y = and is a polyhomogeneous function of (x,y), that is, 
Ab(r k x,r l y) = r n Ab(x, y) for some positive integers k,l,n, then L is analytic hy- 
poelliptic. In our example A = Ab is polyhomogeneous for k — 4, but not for larger 
k. 

Since [X, Y] = X(x,y)d t and A vanishes only on the line {(x,y,t) : x = y = 0} 
in M 3 , the bracket hypothesis is satisfied to order 2 everywhere except on that line. 
Where x = y = 0, it is satisfied to order precisely m — 6, because of the presence 
of the term x 2 y 2 in A. The characteristic variety £ C T*IR 3 of L is a manifold of 
codimension two. It is symplectic, at every point at which the principal symbol of 
[X, Y] is nonzero. Thus S is symplectic everywhere except on the submanifold E'cH 
given by E' = {(x, y, t; £, rj, r) : x = y = £ = i] = 0}. Hence by virtue of the theorem 
of Tartakoff and of Treves, L is microlocally analytic hypoelliptic everywhere except 
possibly on 

Any particular iterated commutator of X, Y has a principal symbol which either 
vanishes identically on E', or vanishes nowhere on E'. Therefore the stratification 

3 This lower order term is not insignificant, but the questions of interest regarding (X+iY)(X— iY) 
concern its microlocal hypoellipticity properties in a subset of the cotangent bundle, where its 
properties coincide with those of X 2 + Y 2 in all known cases. 
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defined by Treves [22] takes the simple form £ D E' D 0. Because £' is itself a sym- 
plectic manifold, his conjecture then predicts that L should be analytic hypoelliptic. 

For simplicity we set k = 6 for the remainder of the discussion. All results gener- 
alize to k > 6, but the details of the analysis change slightly. 

We will establish two contrasting conclusions concerning this example. Set px = 
(0, 0, 0; 0, 0, T) and q T = (0, 0, 1; 0, 0, T), where T 6 1+ is a large parameter. 

Proposition 5.1. There exists c > such that for each large T there exists a func- 
tion (fx which is microlocally Lipschitz relative to {X, Y}, and satisfies \(p(qr) — 

r(l'r) >'T. 

Proposition 5.2. It is not the case that there exist c > and a one parameter family 
of functions ipr such that for each large T G M + ; (fr is microlocally Lipschitz relative 
to {X, Y}, \(Pt(qt) — <Pt(pt)\ > cT, and moreover 

KvAvM < C a , p {l + \^V,T)\y-W for all \a\ + \(3\ < 5. 

Thus whether our philosophy predicts analytic hypoellipticity or not, depends on 
precisely which variant of we consider. This author ventures no conjecture as to 
whether this particular operator is analytic hypoelliptic. 

We first prove Proposition 5.1. We will construct a single function (p which has 
the required properties simultaneously for all T. Consider a function 

^(x, y, t; £, rj, r) = (t + h)T + f • (f - rb y ) + g ■ (rj + rb x ) 

where f,g,h are functions of x, y alone. Our aim is construct f,g,h so that tp is 
microlocally Lipschitz relative to {X,Y}. Since ip(q) — ip(p) = T — = T, we can 
then conclude that Pl(p,q) > cT. 

Because the characteristic variety of L is {x = y = £ = rj = 0} and because the 
function (x,y,t; C,,i],t) i— > r is annihilated by both Hx,Hy, it is easy to conclude 
using the same function tp that more generally, for any separated points p, q, one has 

pL(p,q) > cp (p,q). 

The Hamiltonian vector fields associated to X, Y are 

H x = d x - byd t + Tby X dz + Tb yy d v , 
H Y = d y + b x d t - rb xx d(. - rb xy d v . 

Thus 

Hx<P = -rb y - rfb xy + rgb xx + rfb xy + rgb yy + rh x 

= t[— by + gAb + h x ] plus error terms 
H Y (p = -rfbyy + rgb xy + rb x - rb xx f - rb xy g + rh y 

= T~[b x — f Ab + h y ] plus error terms. 

Each of the aforementioned error terms is a product of one of f x , f y , g x , g y with either 
(£ — rby) or {rj + rb x ). The latter two quantities are the principal symbols of X, Y, 
hence are dominated by a. If /, g are constructed so as to be Lipschitz continuous 
functions of (x,y), then each error term will be majorized by a. Since our goal is to 



plus error terms 
plus error terms 
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have \Hx<p\ + \Hy(f\ < Ca, it therefore suffices to construct Lipschitz functions /, g 
and a function h such that 

(20) b y - gAb = h x and - b x + f'Ab = h y . 

Recalling that Ab = A, a necessary condition for (20) is that b yy — (gX) y = —b xx + 
(f^)x, which may be rewritten as 

(21) A = (f\) x + (g\) y . 

Conversely, if f,g are bounded solutions of (21) then there exists a solution h of 
(20) in the sense of distributions. But the left hand sides of both equations in (20) 
are bounded since b is a polynomial, so h is Lipschitz after possible redefinition on a 
set of measure zero; in fact h G C 1,1 when /, g are Lipschitz. Thus in order to show 
that (p is microlocally Lipschitz relative to {X,Y}, it suffices to prove the existence 
of Lipschitz continuous (in the ordinary sense) solutions /, g of (21). 

Lemma 5.3. There exist Lipschitz continuous functions f,g, defined in a neighbor- 
hood of the origin in R 2 7 satisfying A = (f\) x + (gX) y . 

The conclusion is to be interpreted as equality almost everywhere. Note that if 
A were homogeneous or polyhomogeneous, then there would exist a solution of the 
form / = k±x, g = k^y for certain constants kj, by Euler's identity. It is a relatively 
simple matter to construct Lipschitz solutions of A = / • X x + g ■ X y in the present 
case, but that is not what is required. 

Proof. We write A < B to mean that the ratio A/B of nonnegative functions is 
bounded above by a finite constant, and A ~ B to mean that A < B and B < A. 
Different constructions will be used in different regions of the plane. Consider first 
the region ri where \x\ < 2\y\. We set g = and solve (/A) x = A by defining 

f(x, y) = X(x, yY 1 I A(s, y) ds. 
Jo 

In this region A ~ y 6 + x 2 y 2 , so | fjf X(x, y) ds\ < |x|A(x, y). Hence \ f(x, y)\ < \x\. 
In analyzing V/ it will be useful to note that in r\, 

\K\<X/\x\ and |Ay| < X/\y\. 

To analyze V/ consider first 

f x = l- X~ 2 X X ■ / X{s,y)ds. 
Jo 

Since |A X | < X/\x\ and the integral is < |x|A, the second term is uniformly bounded. 
Similarly 

px px 

f y = -X~ 2 X y A + A -1 / X y (s,y)ds. 
Jo Jo 

In absolute value the first term is < A -2 ■ (A/|y|) ■ (|x|A), which is uniformly bounded 
in Tx. The absolute value of the second term is < A -1 J* X/\y\ ds < |x|/|y|, so likewise 
is bounded. 
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Fix an auxiliary function <fi which is homogeneous of degree zero in R 2 , is C°° 
except at the origin, is identically equal to one where \x\ < \y\/2, and is supported 
where \x\ < |y|. Set / = / • <fi. This formula makes sense only where \x\ < \y\, but wc 
extend the definition by setting / = where \x\ > \y\. Then / is Lipschitz, because 
|V0(z)| < |^| 1 and \f(z)\ < \z\. We claim that where \y\/2 < \x\ < \y\ and \(x, y)\ 
is sufficiently small, 

|V"/(^)| < C^l^l 1 -!"! 

for every multi-index a. Because is homogeneous and smooth except at the origin, 
it suffices to verify this for /. One has explicitly 

xy 6 + x 3 y 2 /3 + x 7 /7 
J[X,y) y e + x 2 y 2 + x 6 ■ 

In the conic region in question, X(z) ~ |,z| 4 , the terms x 6 ,y 6 in the denominator 
being comparatively negligible. Hence \V a f(z)\ < where |x| ~ \y\. The claim 

follows. 

In the (overlapping) region where \y\ < 2\x\, we solve instead (gX) y = A, taking 
/ = 0. Because A is a symmetric function of (x,y), conclusions parallel to those 
above may be obtained. Define g(x,y) = g(x,y)(f)(y,x), and again define g = 
where \y\ > \x\. We have then A = (fX) x + (gX) y where \x\ < \y\/2 and also where 

\y\ < MA 

Define 

A = A - (fX) x - (gX) y . 

A is supported in the conic region T 3 where \y\/2 < \x\ < 2\y\. It satisfies |V a A(2;)| < 
| ^ |4— £ Qr a jj z ET 3 and all a. 

In order to complete the proof of the lemma, it suffices to construct Lipschitz 
functions F, G satisfying 



A = (FA), + (GX) 



y 



To accomplish this we set F = xh and G = yh and solve A = (xhX) x + (yhX) y for the 
single unknown h. In polar coordinates (r, 6) the equation becomes 

A = Xrh r + h ■ (2A + xX x + yX y ). 

Define 

P(x,y) = (2 + xX x /X + yX y /X) 

and note that 2 < (3 and moreover that in T 3 , \V a f3(z)\ < \z\^. The equation to 
be solved is 

(22) h r + h(3/r = r _1 A/A. 

It suffices to produce a bounded solution whose gradient is < r l . 
Note for future use that 



\Pe(r,9)\ 



< 
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in r 3 , not merely < 1. The fact that A is homogeneous of degree 4 modulo higher order 
terms and nonvanishing in r 3 leads by Euler's identity to the conclusions (3(r,6) = 
6 + 0(r) and V/3 = 0(1) in that cone. This strengthened bound is essential to our 
construction. 
Define next 

b(r,6) = J* P{p,0),dp/p. 

Then b(r,6) < for r < 1, and |6(r,0)| ~ logr -1 . Because (3 / p > 2/p, e b {p,0) < p 2 
as p — > 0. 

Define a solution h of (22) by 

h(r,6) = e- b [ e\p,6)\(p } e) d A 
Jo A P 

the integral converges since A/A is bounded and e b < p 2 . Formally h satisfies the 
required equation. Note that h is supported in r 3 , since the integrand vanishes 
identically outside it. 

To complete the proof of the lemma it suffices to show that \h\ < 1, and that 
|V/i(r, 0)\ < r -1 . Because we have seen above that A/A is bounded, an upper bound 
for \h\ is e~ b e b dp/ p. Since b r is between 2/r and C/r, the factor e b is monotone 
increasing, and e b (r/2,9) < \e b (r,6). Consequently this last integral has the same 
order of magnitude as e b , and hence h is uniformly bounded. 

From the boundedness of h and the differential equation (22) it follows immediately 
that the partial derivative h r is < r _1 in absolute value. It remains to show that 
\hg\ < 1. Differentiation in the definition of h yields several terms; the simplest is 
— b e times h. We have 

\be\ < flPeldp/p, 

J r 

and \(3 e (p,6)\ < p\V(3\ < p in T 3 , as observed above. Thus b e is uniformly bounded. 
Hence this simplest term is < h, hence uniformly bounded. 

A second term arises when on differentiating h with respect to 6, the derivative falls 
upon the factor of e b inside the integral. An additional factor of bg results, causing no 
harm since it is uniformly bounded; the analysis indicated above for h itself applies 
also to this term. The third and last term arises when the derivative falls on the 
factor of A/ A inside the integral. Now 

|^(A/A)M)|<l>|V(A/A)|<1 

in r 3 , since |VA| < |z| 4 ~l a l in T 3 , the same holds for A, and \z\ 4 < X(z) there. Thus 
the same analysis applies once more. Hence | V/i| < r" 1 , and the proofs of both the 
lemma and Proposition 5.1 are complete. □ 

Proof of Proposition 5.2. Suppose to the contrary that functions ipr possessing the 
indicated properties were to exist. Note that Hx,Hy are homogeneous of degree 
zero with respect to the dilations (x,y,t;£,r),T) h- > (x,y,t;r£,rr),rT) for r e 
By applying Arzela-Ascoli and a rescaling argument based on these dilations and 
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homogeneity to the collection {</?r}, we could then extract a single function <p which 
in any fixed relatively compact subset of T*M. 3 minus the zero section belonged to C 5 
and satisfied ip(p) 7^ </?(<?), where p — p±, q — q±, and 

(23) \H x <p\ + \H Y <p\ < |£ - rb y \ + \ V + rb x \. 

The vector fields Hx, Hy are tangent to each level set of r, so the restriction of 
ip to {r = 1} must satisfy the preceding inequality. Henceforth we restrict attention 
to that level set, freezing r = 1. Since ip{p) 7^ (p(q), there must exist t such that 
c\</?(0, 0, 0, 0, 1) 7^ 0. Both Hx,Hy are translation invariant with respect to t, so 
we may translate the coordinates so that dt<p(0, 0, 0; 0, 0, 1) 7^ 0. 

Next, expanding in Taylor series with respect to t, £, r\ about the origin and rewrit- 
ing the result slightly gives 

ip(x,y,t;Z,r],l) = h + ~ft + f- [£-b y ]+g- [r} + b x ]+E 

where E(x,y,t; £,77) = 0{t,^,r]) 2 and the coefficients r y,f,g,h are functions of x, y 
alone. Moreover 7(0, 0) 7^ because d t (f 7^ 0; by dividing through by a constant we 
may assume that 7(0, 0) = 1. 

Applying Hx,Hy, evaluating at t = £ = rj = 0, and invoking (23) yields now 
the equations (20) that we arrived at in the proof of Proposition 5.1. Consequently 
A = (f\) x + (g\) y . 

Consider the Taylor expansions of both sides of this last equation about the origin, 
and compare terms of equal degrees. By examining terms homogeneous of degree 
three, one finds immediately that /(0) = g(0) = 0. Equality for degree 4 forces 
/ = c\x + 2 (x, y) and g = C2y + 2 (x, y), where 3ci + 3c2 = 1, in order to reproduce 
the term x 2 y 2 on the left without introducing other monomials of degree 4. The 
contribution to (f\) x + (g\ y ) of the sum of all terms homogeneous of degree 2 in the 
Taylor expansions of /, g is a homogeneous polynomial of degree 5 plus a remainder 
which is 7 (x,y). The part of degree 5 must vanish identically, since no such terms 
are present in the expansion of A. Consider finally the coefficient of x 6 on the right. 
It equals 7c 1 + C2, for the monomials x 2 y 2 and y 6 in A cannot possibly lead to any 
x 6 term on the right. Similarly the coefficient of y 6 equals c\ + 7c2. Therefore 
7ci + c 2 = 1 = Ci + 7c2. The unique solution is c\ = C2 = 1/8, but this is incompatible 
with 3ci + 3c 2 = 1. This completes the proof of nonexistence of (p. □ 

Summary. Although a conjecture linking the growth of pi to hypoellipticity, in 
various function spaces, is consistent with a wide variety of examples, Example 9 
suggests that this conjecture may be overly simplistic. Another reason for wariness 
is that such a conjecture takes no account of the behavior of the symbols a Xj in 
the complexified phase space. To decide whether Example 9 is analytic hypoelliptic 
might be quite illuminating. 

Acknowledgement. The author is indebted to Chun Li for a suggestion used in the 
proof of Lemma 5.3, and to Charles Fefferman for kindly explaining [11] and [18]. 
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